Measuring stationarity in long-memory processes 



Kemal Sen, Philip PreuB, Holger Dette 
Ruhr-Universitat Bochum 
Fakultat fiir Mathematik 
44780 Bochum 
Germany 
email: kemal.sen@ruhr-uni-bochum.de 
email: philip.preuss@ruhr-uni-bochum.de 
email: holgcr.dette@ruhr-uni-bochum.de 

March 15, 2013 
Abstract 

In this paper we consider the problem of measuring stationarity in locally stationary long- 
memory processes. We introduce an L 2 -distance between the spectral density of the locally 
stationary process and its best approximation under the assumption of stationarity. The dis- 
tance is estimated by a numerical approximation of the integrated spectral periodogram and 
asymptotic normality of the resulting estimate is established. The results can be used to con- 
struct a simple test for the hypothesis of stationarity in locally stationary long-range dependent 
processes. We also propose a bootstrap procedure to improve the approximation of the nominal 
level and prove its consistency. Throughout the paper, we will work with Riemann sums of a 
squared periodogram instead of integrals (as it is usually done in the literature) and as a by- 
product of independent interest it is demonstrated that the two approaches behave differently 
in the limit. 
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1 Introduction 

The assumption of (second-order) stationarity is quite common in the analysis of time series data 
like wind speeds, computer network traffic or stock returns. This condition allows for a well 
developed statistical analysis, and there exist numerous books and articles dealing with parameter 
estimation or forecasting techniques. However, under the assumption of stationarity many real 
world phenomena can only be described by complicated and less intuitive models. A typical example 



Figure 1: Left panel: log-returns of the IBM stock between June 9th 2004 an d July 24th 2012, 
middle panel: ACF of the log-returns Xt, right panel: ACF of the squared returns Xj* 



can be found in the left panel of Figure [T] which shows 2048 log-returns of the IBM stock between 
June 9th 2004 and July 24th 2012. We observe that the autocovariance function (ACF) ~f{k) = 
Cov(Xo, Xk) of the log-returns Xt is converging to zero very "fast" as k — > oo, while this is not the 
case for the ACF of the squared returns Xf [see the middle and right panel in Figure [lj. The latter 
effect serves as the usual motivation to employ stationary long-memory models in the analysis of 
stock volatilities [see |Breidt et al.] fll998[ )]. This means that stationary processes satisfy 
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(1.1) 



for some d £ (0,0.5), which is called the long-memory parameter. Examples which fit into this 



framework are the well-known FARIMA(p, d, g)-models introduced by Granger and Joyeux (1980) 



and Hosking (1981). However, these kinds of processes are not very intuitive and it was suggested 



by several authors that one should use simple but non-stationary "short-memory" models instead 



[see for example Mikosch and Starica (|2004|), Starica and Granger (2005), Fryzlewicz et al. (J2006) 
or 



Chen et al. (2010) among others]. Therefore an important question of interest in this context is, 



if the data should be analyzed by a stationary long-range dependent model or by a non-stationary 
"short-memory" model. 

In the present paper we propose a measure of stationarity in long-range dependent locally stationary 
processes, which is used for the construction of a consistent test for the hypothesis of stationarity. 
Since the assumption of stationarity is crucial in the application of various statistical tools, there 
exist several procedures to validate this condition in the context of short-memory processes. A first 



test for stationarity in locally stationary processes [as introduced by Dahlhaus (1997)] was proposed 



by von Sachs and Neumann (2000) and is based on the estimation of wavelet coefficients by a 



localized version of the periodogram. Paparoditis (2009, 2010) suggested an L2-distance between 



the estimated spectral densities under the assumptions of stationarity and of local stationarity, and 



Dwivedi and Subba Rao (2010) developed a Portmanteau-type test statistic to detect deviations 



from stationarity. Besides the choice of a window width for the localized periodogram, which is 
inherent in essentially any statistical inference for locally stationary processes, all these methods 
require the choice of at least one additional smoothing parameter, such as the order of the wavelet 



expansion, a bandwidth for the estimation of the spectral density or the order in a Portmanteau- 
type test. Dette et al. (2011a) developed tests for stationarity in the framework of locally stationary 
processes which only require the choice of one regularization parameter, namely the window length 
for the localized periodogram in the estimation procedure. These authors considered the L2- 
distance 
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(1.2) 



between the time varying spectral density f(u, A) and its approximation A 1— > f f(v, X)dv through 
a spectral density corresponding to a stationary process. It is easy to see that the process is 
stationary (i.e. the time varying spectral density does not depend on u) if and only if D 2 = 0, and 
D 2 can be considered as a measure of deviation from stationarity in the frequency domain. This 



quantity corresponds to the measure used in Paparoditis (2009), but unlike to this author, Dette 
et al. (2011a) estimated D 2 directly via Riemann sums of the (squared) local periodogram instead 



of a smoothed local periodogram and thus avoided the choice of a smoothing parameter. Preufi 



et al. (2012) proposed an alternative measure for deviations from stationarity which is based on 



the Kolmogorov-Smirnov distance 
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[see also Dahlhaus (2009)]. Both approaches have their pros and cons. In particular tests based on 



the distance (1.3) are vT-consistent (here T denotes the sample size). On the other hand it is well 
known that - although such tests are consistent against alternatives converging to the null hypothesis 
at a parametric rate - Kolmogorov-type and related tests greatly weigh down contributions from 



high frequency components [see Ghosh and Huang (1991), Eubank and LaRiccia (1992) or Fan 



(1996)]. Moreover, the limiting distribution of Kolmogorov-Smirnov-type test statistics is usually 
not known. In principle this problem can be solved by bootstrap methods. However in many cases 
this yields to a loss of power. On the other hand, tests based on the L2-approach can often use 
critical values from the standard normal distribution. 

As all procedures which have been suggested so far for discriminating between stationarity and non- 
stationarity, the tests proposed by Dette et al. (2011a) and PreuB et al. (2012) are only applicable 
to short-memory processes, and the development of a corresponding methodology in the context of 
long-range dependence is missing. In fact, although stationary long- memory models are employed 
numerously in practice, there do not exist many tests for the hypothesis of stationarity which 



include these processes. Berkes et al. (2006), Sibbertsen and Kruse (2009) and Dehling et al. 



(2011) consider CUSUM and Wilcoxon type tests to discriminate between long-range dependence 
and one change in mean. However a change with respect to the mean is of course only the simplest 
possible deviation from stationarity and there is particular interest in measuring deviations in the 
dependency structure over time as well. 

Recently, PreuB and Vetter (2012) developed a test for stationarity which includes the long-range 
dependent case and is based on the distance (1.3). As mentioned in the previous paragraph there 
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exist several situations where this approach is not the best and for this reason we consider in this 
paper an alternative test which is based on the measure defined in (1.2). For this purpose, we 
estimate the integrals in the distance D 2 in (1.2) by Riemann sums where the unknown spectral 
densities are replaced by periodograms. For the resulting statistic we will show that an appro- 
priately standardized statistic converges to a standard normal distribution if the (possibly time 
varying) long-memory parameter d(u) is smaller than 1/8. These results are used to develop a 
bootstrap procedure for the approximation of the limit distribution and to prove its consistency in 
the general case. 

Although that the proof of asymptotic normality seems to be more of theoretical nature, because the 
bootstrap procedure derived in the second part of the paper can in principle also be applied in the 
case d{u) < 1/8, these results are of interest from several perspectives. Firstly, several arguments 
used in the proof of asymptotic normality are also required in the proof of consistency of the 
bootstrap procedure and easier illustrated in the unconditional case. Secondly, and most important, 
the estimate D\ of D 2 is based on estimates of the integrated and integrated squared spectral 
density L f(u,X)du and f 2 (u, A) dXdu, respectively. For this purpose we use Riemann 

sums of the squared periodogram instead of not computable integrals as it is usually done in the 
literature [see Taniguchi (1980), Fox and Taqqu (1987) and Palma and Olea (2010) among others]. 
Although one might expect that both estimators exhibit a similar behavior with respect to weak 
convergence, it is demonstrated in Section [3] that this is not the case in the present context. A 
similar observation was also made by Deo and Chen ( 2000 ) in the case of short-memory stationary 
processes. To the best of our knowledge, even in the (much simpler) stationary case, Riemann sums 
of a squared periodogram have not been considered in the literature for the long-range dependent 
case. 

The remaining part of this paper is organized as follows: In Section 2, we introduce the necessary 
notation and define an empirical measure of stationarity. In Section [3j we prove that an appropri- 
ately standardized version of this measure converges weakly to a standard normal distribution if 
the time varying long-memory parameter is smaller than 1/8. In Section 4, we present a bootstrap 
procedure to approximate the distribution of D 2 -, and prove its consistency. The finite sample prop- 
erties are investigated in Section 5. Finally, we defer all technical details to appendices in Section 
6 and 7. 



2 Measuring stationarity in locally stationary long-memory pro- 
cesses 



In order to obtain a measure of stationarity including the long-range dependent case, we require a 
set-up which is flexible enough to cover stationary long-memory processes and a reasonable time- 
varying extension of it as well. For this reason, we consider the following theoretical framework of 



a locally stationary long- memory process [similar approaches can be found in Beran (2009), Palma 



and Olea (2010) and Roueff and von Sachs (2011)]. 
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Assumption 2.1. Let {X t ^}t=i,...,T denote a sequence of stochastic processes which have a MA(oo) 
representation of the form 



X t ,r = ^2 ip t ,T,iZ t -i, t = 1, . . . , T, 



(2.1) 



1=0 



such that 



sup sup yZ^t, T ,i<oo, 
T&ite{i,...,T}^ 



(2.2) 



where {Z t }t^z are independent and standard normal distributed random variables. We further 
assume the following conditions. 



1 ) There exist twice continuously differ entiable functions ipi : [0, 1] — >■ M (I G Z) such that 

C 



sup 

t=l,...,T 



< 



and 



a(u) 



/(j)i-d(u) 



TI(l) 



O 



l-d 



Vie IN 



(2.3) 



1(1) 



2-d a 



(2.4) 



holds uniformly in u as I — > oo, where d,a : [0,1] — > M + are twice differ entiable functions, 
C G M + and doo = sup ug [ 01 ] d(u) are constants and I[x) := \x\ • ljz^o} + l{x=o}- 



2) The time varying spectral density 

1 

f(u,X) := — \^2^i(u) e-Kp(-iM) 



1=0 



(2.5) 



is twice continuously differentiable on (0,1) x (0,7r). Moreover, f(u,X) and all its partial 
derivatives up to order two are continuous on [0,1] x (0,7r]. 



3) There exists a constant C G M + , which is independent of u and X, such that for I / 

sup < , 

ue(o,i) I'I 

Clog 2 IZI 

sup W{u)\ < f^ 1 . 

«G(0,1) W °° 



In addition, we assume 



sup 

ue(o,i) 



sup 

ue(o,i) 



d_ 
du 

du 2 



f(u,X) 
f(u,X) 



< 



< 



Clog(A) 

\2dcv ' 
Clog 2 (A) 

\2doc 



(2.6) 
(2.7) 

(2.8) 
(2.9) 
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For the sake of a transparent notation, we will use C E M + as a universal constant throughout this 



paper. Note that the process is stationary if ipitT = fpi for all l,t,T E IV. Condition (2.2) ensures 



that the infinite sum in (2.1) exists in the L2 sense, and (2.3) means that the process X^t c an be 



approximated by a stationary model on a small time interval. It is also worthwhile to mention that 
the assumption of Gaussianity is only imposed here to simplify technical arguments [since they are 



quite involved even in this case]; see Remark 3.5 for more details. Next, we consider the process 

(2.10) 



in order to visualize some properties of a locally stationary long-memory process. Firstly, Xt(u) 
is stationary for every fixed u G [0, 1] and analogously to the stationary case, the condition ( |2.4[ ) 
implies the existence of bounded functions yi : [0, 1] — > K (i = 1, 2) such that 



\Cav{X t (u),X t+k (u))\ 



2/1 C") 

y.l-2d(u) 



as k 



DC 



and 



f(u,\) 



V2{u) 

X 2d(u) 



as A — > 0; 



(2.11) 



sec 



i Palma and Olea (2010)]. Consequently, the autocovariance function j(u, k) = Cov(Xq(u), Xk(u)) 
is not absolutely summable and the time varying spectral density f(u,X) has a pole at A = for 
any u G [0, 1]. 

As an example which fits in this theoretical set-up we consider the time varying FARIMA(j>, d, q) 
model defined by the equation 



a(t/T,B)(l - B) d ^X t , T = b(t/T,B)Z t , t = 1,...,T, 



(2.12) 



where B denotes the backshift operator 
a(u, z) := 1 

for given functions dj, bj : [0, 1] — > M, and d : [0, 1] 



aj(u)z\ b(u, z) :- 



i=i 



u )z J 



(0, doo] is twice continuously differentiable with 
doo < 1/2. It is shown in PreuB and Vetter (2012) that under certain regularity conditions on the 



functions aj, bj, these equations have a locally stationary solution in the sense of Assumption 2.1 If 
the functions a,- 



ij, bj and d do not depend on u, (2.12) corresponds to the common FARIMA(j>, d, t 



equation [see for example Palma (2007) for conditions for the existence of a solution], which is 



included in our theoretical framework. 



For the construction of an estimate of the quantity (1.2) we note that 

D 2 = 2vrFi 



where 



1 

2^ ./„ 



4vrF 2 , 



f 2 (u, X)d\du, 



(2.13) 



(2.14) 
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f(u,X)du\ dX. 

l2 ■ 



(2.15) 



Consequently, it follows from (2.11 ) that the distance D is only well defined if d^ < j. We assume 



without loss of generality that the sample size T can be decomposed into M blocks with length N 
(i.e. T = NM) where N and M are positive integers and N is even. A rough estimator for the 
time varying spectral density f(u, X) is then given by the local periodogram at the rescaled time 
point u £ [0, 1] which is defined by 



N-l 



In(u,X) :-- 



2ttN 



^2 x [uT\-n/2+i+s,t exp(-iXs] 



where Xj^ = for j {1, . . . , T} [see Dahlhaus (1997)]. This is the usual periodogram computed 



from the observations X\ u rp\-N/2+i,T> ■ ■ ■ ; x [uT\+n/2,Tj an d it can be shown that it is asymptotically 
unbiased for the time- varying spectral density f(u, X) if N — > oo and = o(T). However, In(u, X) 
is not consistent just as the usual periodogram. In addition, In(u,X) 2 is an unbiased (but not 
consistent) estimate of the quantity 2/ 2 (n, A) instead of f 2 (u, A). 



We now construct empirical versions of (2.14) and (2.15) by replacing the integrals through appro- 
priate Riemann-sums and substitute f(u, A) and f 2 (u, A) by In{u, X) and In{u, A) 2 /2, respectively. 
For this purpose, we define the rescaled mid-points of the M blocks 



T 



N(j 



N/2 



(j = l,...,M) 



and consider the statistics 



m Lf J 



1 



j=l k=l 

LfJ 



*2,T 



1 / 1 M 

xY.{M^ lN{UjAk > N) 



(2.16) 



(2.17) 



k=l 



where Xk,N = 2irk/N denote the usual Fourier frequencies. The empirical measure of stationarity 



(1.2) is finally given by 



t)\ := 2ttF 1jT - AirF 2 , T - 



(2.18) 



We would like to point out here that it is far from obvious that l) 2 -, is a consistent estimator of D 2 . In 
general it is not true that an integrated function of the periodogram converges to the corresponding 
integrated function of the spectral density. This - at a first glance - is a counterintuitive property 



of the integrated periodogram and was already observed by Taniguchi (1980) in the context of 



stationary short-memory processes. These problems are also visible here as well as we require a 
multiple of In(u, A) 2 to obtain an asymptotically unbiased estimator for f 2 (u, A). In the following 
section we will prove consistency of D\ and study its weak convergence. 
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3 Consistency and weak convergence 

Throughout this paper, the symbols — > and — > denote convergence in probability and weak 
convergence, respectively. In order to specify the bias of F\ t T and i*2,T we define 

j=l J ~* J -* j=l 

and obtain the following results. 



Theorem 3.1. Suppose Assumption 2.1 holds with sup ug [ ^ d(u) < 1/4 and t/ioi i/ie conditions 



N 

iV^oo, (3.1) 



«. -p -p 
are satisfied. Then F\ 7 t ► i*2,T — ^ ^2 ^ n particular 



D 2 ^D 2 



as T — > oo. 



Theorem 3.2. Suppose Assumption 2.1 holds with doo = sup ug r 0) i] d{u) < 1/8 and that the condi- 
tions 



N 

N -> oo, — -> 0, * . . ► (3.2) 

are satisfied. Then as T — > oo we /iat>e 

VT{(A,T, - (Fl,T, F 2 ,T + <*7V,T) T - c T } AA(0, £), 

where the covariance matrix S and £/ie constant d^ t T are given by 

f £ /*(«> X ) dudX I (Jo /(«> A)** £ / 3 (n, A) du) dA 



|/! 1 r(/o 1 /(«,A)Ai/ 1 / 8 (ti,A)Ai)dA J (J? /(«,*)**) £ / 2 (u, A)du)dA 



(3.3) 



d7V ' T = 4rf>^ ' />^)dA, (3.4) 



7=1 • /_7r 



respectively, and the vector Ct £ -K 2 is o/ order O (N 2 /T 2 + log(iV)/(MA^ 1_4d ° )) . Jn particular, 
this term vanishes if the functions ipi(u) are independent of u for all I £ Z, [i.e. the spectral density 
f(u, A) o/ the underlying process Xt t T * s independent of uj. 



A similar result for the short-memory situation has been derived by Dette et al. (2011a). In contrast 



to their result, there appears an additional bias term Ct in Theorem 3.2 This term is negligible 
under the additional condition ,/V 2 /T 3 / 2 — > which holds under the stronger restriction d^ < 1/12 
due to (3.2). On the other hand, under the null hypothesis of a time independent spectral density 

Hq : f(u, A) is independent of u, (3-5) 



8 



we have that Ct = (this follows from the proof of Theorem 3.2 in the Appendix). Since the 
covariance matrix (3.3) contains the integrated fourth power of the spectral density, we obtain 
from (2.11) that Theorem 3.2 is not valid whenever doo > 1/8. Writing (Ci : t,C2,t) T '■= Ct, a 
straightforward application of the Delta- method yields the following result. 



Corollary 3.3. Under the assumptions of Theorem 3.2, it holds 



T[D T -D T + And NtT + AttC 2 ,t - 2ttC x ,t 



v 



(3.6) 



where D T := 2irF\ T — ^ttF^t and the asymptotic variance is given by 



207T 
+ 167T 



TT JO 



f(u,\)dud\-32TT j [J f{u,\)du f(u,X)du)dX (3.7) 



1 N2 
f(u, X)du 



o 



f 2 (u,X)du )dX. 



Under the null hypothesis (3.5) we have D T = C\ t = C2,T = and the asymptotic variance in 
(3.7) reduces to rfj Q := Air f 4 (X)dX. The asymptotic bias Aird^^ = ^j^-Fi t T can easily be 
estimated by the statistic Bt '■= ^^-Fi,t and we infer from Theorem 



3.2 



T Bt — 4itdN t 



2 ^Vt(f 1 t-f 1 , t 



V 



0. 



Thus Slutzky's Lemma together with (3.6) yields 



T[D T + B T 



v 



Af(o,4 o ; 



(3.8) 



under the null hypothesis. To construct an asymptotic level a-test for stationarity, it therefore 
remains to estimate the variance r^ Q in (3.8), and an estimator for this quantity is given by 
ffj o := 4ir 2 ff with 



1 Lfj M 

k=l j=l 

The consistency of this estimator follows from the next theorem. 



Theorem 3.4. // the assumptions of Theorem 3.2 are satisfied, we have 



^2 V 



TT fl 



/ 4 (u, X)dudX. 



TT JO 



Combining (3.8) with Theorem 3.4 yields 

f( t) T + B T 



v 



AA(0, 1) 



(3.9) 



and therefore an asymptotic level a-test is obtained by rejecting the null hypothesis (3.5) whenever 

T[D 2 T + B T ) / ^ > Ul - a , (3.10) 

where u\_ a denotes the (1 — a)-quantile of the standard normal distribution. It follows from 
Theorem 3.2 that this test is consistent, because the left hand side of (3.10) converges to infinity, 
whenever there exists a A S [— 7r, tt] such that the function u \-t f(u, A) is not constant. 
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Remark 3.5. If the innovation process (Zt)tez in (2.1) is not Gaussian, it can be shown that 
Corollary 3.3 is still valid where the asymptotic variance t 2 in (3.7) has to be replaced by 



«4 



f 2 (u,X)dX) du + A 



1 x v 2 

f(u, A) du dX du 



o 



f 2 (u,X)dX f{u,X)[ / f{v,\)dv )dX )du 



and K2 and K4 denote the second and fourth cumulants of the innovation process, respectively. In 
particular, under the null hypothesis of stationarity, it follows that r 2 = r 2 = t^ o and hence no 
adjustments in the asymptotic level a-test in (3.10) are necessary to address non normal distributed 
innovations. 

Remark 3.6. We note that for locally stationary long-range dependent models the asymptotic 
variances of the statistics 



'1,T 



i —Y^ri N {u v xfdx 



AttM 



3=1 



and of Fi x 7 defined in (2.16), are different. In fact it follows by similar arguments as given in the 
appendix that 

fir r l 



while 



14 

lim TVar(Fix) = — 
T^oo v ' ' 3vr 



lim T Varfiq T ) = - 



/ (u, X)dudX, 



7T Jo 



7T r i 



f 4 (u, X)dudX 



IT JO 



by Theorem 3.2. Moreover, similar arguments as given in the proof of this statement show that 
even in the stationary case the asymptotic variance of the statistic It(X) 2 dX and its discretized 
version (2tt/T) Ylk=i ^T(Afc,r) 2 are not the same (here It(A) denotes the usual periodogram and 
Xk,x = 2nk/T are the Fourier frequencies). Deo and Chen (2000) observed the same effect in the 
context of stationary short-memory processes. 



4 Critical values by resampling 



2.1 



We will show 



We now consider the more general set-up with doo < \ as specified in Assumption 
that in this case a bootstrap procedure can be used to approximate the distribution of I) T under 
the null hypothesis (3.5). We employ the FARI(oo) bootstrap which was recently introduced by 
Preufi and Vetter (2012) and fits an FARIMA(p, d, 0)-model to the data, where p = p(T) converges 



to infinity with increasing sample size T. To prove consistency of this procedure, we require the 
following technical assumptions. 
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Assumption 4.1. For the stationary process {Xt}tez with strictly positive spectral density A i— > 
Jo /( u > tydu, there exists a constant d 6 (0, 1/4) such that the process 

Y t = (l- BfX t (4.1) 

has an AR(oo) -representation of the form 

oo 

Y t = J2a j Y t - S + Zf R , (4.2) 

3=1 

where {Zj}jez denotes a Gaussian White Noise process with variance a 2 > and the coefficients 
in the representation (4.2) satisfy 

oo 

X>il|j| 7 <oo, (4.3) 
i=i 

oo 

1-J2 a i zj ^° /° r M ^ L ( 4 - 4 ) 

Note that under the null hypothesis of a time independent spectral density, it follows that d = doo = 
d(u) for all u £ [0, 1], but under the alternative we usually have d ^ doo. The FARI(oo) bootstrap 
incorporates the following steps: First we choose a p = p(T) E IN to construct an estimator, say d, 
of the long-range dependence parameter d in model (4.1). Secondly we calculate an estimator of 

( ^ \ 2 

(a ljP , a PiP ) = argmm Ef Y t - ^ h^ p Y t ^A , (4.5) 

by fitting an AR(p)-model to the data. In order to describe the main idea of our procedure in more 
detail, we introduce the "true" approximating process Y t AR (p) by 

Y t AR (p) = j2HpYt^(p) + Zf* (4-6) 

3=1 



where the parameters aj iP are defined in (ULo) and {Z AR } t£ll is a Gaussian White Noise process 
with mean zero and variance a p = 1E(Y" 4 — Y^=i a j,p^t-j) 2 - If p = p(T) — > oo the process Y AR {p) 
approximates Yj and therefore (1 — B)~-Y AR (p) is "close" to the stationary process Xt whose spec- 
tral density is given by A H > Jq f{u, X)du. Under the null hypothesis of stationarity, this function 
coincides with the spectral density of {^t,r}t=i,...,T- Hence, observing the data Xi t x, ■■■,Xt ! t, the 
FARI(oo) bootstrap precisely works as follows: 



1) Choose p = p(T) G IN and calculate 9t, p = (d, a p , di iP , a P)P ) as the minimizer of 

T/2 

T 



ii 



where P = (d, a p , ai jP , a PiP ), 

i-' 

2tt7' 

is the usual periodogram, and 



1 I T 

MA) = — =E X *.reacp(-»At) 



t=i 



|l-expHA)|- 2 ^ 
•M A ) = SZ x 



2vr |1 - Y%=i Oj,pexp(-iAj)| 2 

is the spectral density of a stationary FARIMA(p, d, 0)-model. Note that the estimator 9t p 
is the classical Whittle estimator of a stationary process [see Whittle (1951)]. 

2) Calculate Y^t = (1 — B) Xt,T for t = 1,...,T and simulate a pseudo-series Y* T , Yj, T 
according to the model 

Y* T = Y t , T - t = l,...,p, 
p 

Kt = E ^P Y t- h T + & P Zj, P<t<T, 

3=1 

where Zj denotes an independent sequence of standard normal distributed random variables. 

3) Create the pseudo-series X* T , ...,X%, T from the equation 

X* T = (l-B)-iY* T (4.7) 

and compute Dj,* in the same way as t)\ where the original observations X\^ y ••• j ^t,t are 
replaced by the bootstrap replicates X* Tl ...,Xj, T . 

Our main theorem in this section describes the theoretical properties of this procedure. 



Theorem 4.2. Assume that the null hypothesis (3.5) holds and let Assumption 2.1 and 4-1 be 
fulfilled. Furthermore, suppose that the conditions 

N T 

N^oc, -^0, (4.8) 

are satisfied for some < 5 < 1/2, and assume for the growth rate (rate of convergence) ofp = p(T) 
the following: 

i) There exist sequences p max {T) >p m in{T) T ^° D > oo such that p{T) G [p m i„{T),p max (T)], 

pLxmiog(T) 3 iV 5 r- 1 = 0(l), (4.9) 
Vfp m 9 m (T)/y^og(T) = o(l). (4.10) 

ii) The condition 



\\hv-^\U = Op\^J l ^-j (4.11) 

is fulfilled uniformly with respect to p, where 6t, p denotes the estimator used in step 1) of the 
bootstrap procedure and 8 p = (d p , cr 2 , ai )P , ...,a P)P ) are the corresponding "true" parameters. 
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9 2 * 

Then there exist random variables D^, „ and such that 



T,a 



(a) 
(b) 



h 2 - n 2 >* 
Var^)- 1 / 2 ^. 



D 2 T , a 



D 



2.* 



(d) ~E\D, 



2,* I 
T,a\ 



o 



T,a 

jymax(4d-l/2,0) 



op(1), 
= o P (l), 

Vlog(A r )l{d = i/8} 



1 



iV 



l-4d 



T/ie estimate in (d) also holds if the null hypothesis (3.5) is no£ satisfied. 



Note that conditions like (4.9)-(4.11) are standard in the context of parametric bootstraps [see for 



example Berg et al. (2010) or Kreifi et al. (2011 )] and a detailed discussion of them is given in Preufi 
(2012). We now obtain an asymptotic level a-test based on D 2 -, as follows: Calculate B 



and Vetter 



bootstrap replicates D 2 ^*, denote by (D 2 ^*)t,i 1 (D 2 ^*)t,b the resulting order statistic and reject 
the null hypothesis whenever 



D* T > (Z4'>,L( 



l-a)B\ ■ 



(4.12) 



Theorem 4.2 and the argumentation in Paparoditis (2010) indicate that this procedure is valid 



for obtaining an asymptotic level a-test. In order to prove this more formally, we follow Bickel 



and Freedman 



(1981) by considering the Mallow metric ^(i 7 , G) = inf y / 'Ei(X — Y) 2 between two 
distributions F and G, where the infimum is taken over all pairs (X, Y) of random variables with 
marginal distributions F and G. Theorem 4.2 then yields the following result which states that the 



test (4.12) has, in fact, asymptotic level a. 



Theorem 4.3. Suppose the null hypothesis (3.5) and the assumptions of Theorem 4-2 are satisfied. 
Then, as T —> oo, the Mallow distance di between the distributions of the random variables 



b^/^X&vib 2 ,) and D 2 / /^Y&r{D^ 
converges to zero in probability. 



2.*N 



Consistency under the alternative follows since Theorem 4.2 d) yields that each bootstrap statistic 
D 2 ^* converges to zero while b\ exceeds some positive constant (for T sufficiently large) due to 
Theorem 13.11 



5 Finite sample properties 



In this section we examine the finite sample properties of the proposed decision rule (4.12). An 



important problem is the choice of the window length iV for the calculation of the local periodogram 
and the choice of the AR parameter p in the bootstrap procedure. Throughout this section we 



choose p as the minimizer of the AIC criterion [see Akaike (1973)], which is defined by 
p = argmin p — ^ ( log j^) (Afc.r) + 



k=l 



+ 



p 



13 



in the context of stationary processes due to 



Whittle 



(1951) [here is the spectral density of 



the fitted stationary FARIMA(p, d, 0) process and It is the usual stationary periodogram] . We 
therefore restrict ourselves to an analysis of the sensitivity with respect to N in the following, and 



it will turn out that the test (4.12) using the FARI(oo) bootstrap exhibits a remarkable robustness 
with respect to the choice of N. All reported results of this section are based on 1000 simulation 
runs and 200 bootstrap replications. 



5.1 Size of the test 

In order to investigate the approximation of the nominal level we simulate data from the 
FARIMA(1, d, 0) model 



and the FARIMA(0, d, 1) process 



(l-cj)B)(l-B) d X t = Z t 



(1 - B) d X t = (1 + 0B)Z t 



(5.1) 



(5.2) 



for different values of (j>, 6 and d where the random variables Z% are independent standard normal 



distributed. The rejection probabilities for the bootstrap test (4.12) are displayed in Table [lj|4| 
where d S {0.1,0.2}. We observe a very precise approximation of the nominal level in nearly all 
cases which is rather robust with respect to different choices of the parameter M and N. 



In order to study the power of the test we consider the following alternatives 

(1 -B) d X t)T = Zt + 0.8cos(1.5-cos(47rt/T))Zt_i ) (5.3) 
(1 - 0.6sin(47rt/T)5) (1 - B) d X tiT = Z t , (5.4) 
(1 - B) d X t .T = Vsin(7rf/T)Z t , (5.5) 

where d = 0.2. These kinds of alternatives were investigated by several authors in the context 



Paparoditis 


(2010 


) and 


Dahlhaus 


(1997) 



rejection frequencies for the bootstrap test (4.12) are depicted in Figure 2H4 for different combi- 



nations of T and N. Additionally, the results for the Kolmogorov-Smirnov approach of Preufi and 



Vetter (2012) are presented. We observe that the new procedure clearly outperforms the test of 



PreuB and Vetter (2012) for the models (5.3) and (5.4) while the Kolmogorov-Smirnov test works 



better for the process (5.5). In addition, we observe that the new decision rule is less sensitive with 



respect to different choices of N than the test based on the Kolmogorov-Smirnov distance. 



Acknowledgements. This work has been supported in part by the Collaborative Research Center 
"Statistical modeling of nonlinear dynamic processes" (SFB 823, Teilprojekt Al, CI) of the German 
Research Foundation (DFG). 
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<t> = 


-0.9 


4> = 


-0.5 


</> = 


4> = 


0.5 


= 


0.9 




T 


N 


M 


5% 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


Al 


128 


16 


8 


.126 


.182 


.072 


.123 


.036 


.074 


.073 


.126 


.084 


.147 


A2 


128 


8 


16 


.140 


.200 


.085 


.132 


.041 


.090 


.073 


.128 


.084 


.118 


Bl 


256 


32 


8 


.065 


.135 


.064 


.119 


.042 


.088 


.062 


.113 


.075 


.147 


B2 


256 


16 


16 


.080 


.132 


.056 


.108 


.040 


.082 


.051 


.109 


.062 


.109 


B3 


256 


8 


32 


.068 


.111 


.045 


.095 


.046 


.097 


.072 


.147 


.049 


.114 


CI 


512 


64 


8 


.054 


.109 


.049 


.106 


.039 


.089 


.049 


.114 


.082 


.134 


C2 


512 


32 


16 


.038 


.093 


.043 


.086 


.039 


.085 


.059 


.108 


.065 


.132 


C3 


512 


16 


32 


.061 


.095 


.051 


.102 


.045 


.081 


.059 


.109 


.043 


.104 


C4 


512 


8 


64 


.060 


.107 


.053 


.098 


.045 


.083 


.060 


.116 


.042 


.093 


Dl 


1024 


128 


8 


.039 


.104 


.042 


.093 


.042 


.085 


.035 


.093 


.079 


.132 


D2 


1024 


64 


16 


.053 


.104 


.058 


.097 


.050 


.110 


.057 


.101 


.068 


.126 


D3 


1024 


32 


32 


.033 


.076 


.058 


.114 


.046 


.086 


.070 


.107 


.062 


.115 


D4 


1024 


16 


64 


.046 


.089 


.036 


.091 


.044 


.084 


.054 


.109 


.044 


.099 


D5 


1024 


8 


128 


.037 


.073 


.041 


.091 


.041 


.091 


.061 


.131 


.045 


.097 



Table 1: Rejection probabilities of the bootstrap test (4.12) under Hq for different choices of T,N 



and M. The data was generated according to model (5.1) with d = 0.1 and different values for <f>. 







<t> = 


-0.9 


4> = 


-0.5 


= 


4> = 


0.5 


<$> = 


0.9 




T 


N 


M 


5% 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


Al 


128 


16 


8 


.107 


.164 


.063 


.114 


.050 


.108 


.072 


.121 


.108 


.166 


A2 


128 


8 


16 


.106 


.160 


.064 


.118 


.041 


.085 


.073 


.124 


.078 


.138 


Bl 


256 


32 


8 


.064 


.123 


.048 


.104 


.042 


.094 


.075 


.131 


.079 


.137 


B2 


256 


16 


16 


.058 


.125 


.051 


.101 


.040 


.101 


.065 


.112 


.055 


.116 


B3 


256 


8 


32 


.079 


.124 


.047 


.089 


.051 


.091 


.053 


.106 


.050 


.105 


CI 


512 


64 


8 


.050 


.093 


.048 


.090 


.051 


.103 


.047 


.104 


.075 


.133 


C2 


512 


32 


16 


.047 


.104 


.044 


.087 


.039 


.085 


.053 


.109 


.068 


.124 


C3 


512 


16 


32 


.042 


.097 


.044 


.087 


.057 


.106 


.046 


.105 


.060 


.104 


C4 


512 


8 


64 


.050 


.102 


.053 


.101 


.052 


.088 


.058 


.121 


.062 


.114 


Dl 


1024 


128 


8 


.044 


.090 


.046 


.102 


.051 


.107 


.039 


.092 


.076 


.140 


D2 


1024 


64 


16 


.043 


.082 


.040 


.088 


.050 


.098 


.046 


.098 


.060 


.106 


D3 


1024 


32 


32 


.045 


.089 


.054 


.097 


.057 


.103 


.060 


.104 


.066 


.115 


D4 


1024 


16 


64 


.044 


.087 


.038 


.087 


.049 


.094 


.059 


.106 


.051 


.101 


D5 


1024 


8 


128 


.041 


.082 


.041 


.089 


.038 


.086 


.061 


.103 


.054 


.103 



Table 2: Rejection probabilities of the bootstrap test (4.12) under Hq for different choices of T,N 



and M. The data was generated according to model (5.1) with d = 0.2 and different values for <f>. 
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e = 


-0.9 


e = 


-0.5 


= 


= 


0.5 


e = 


0.9 




T 


N 


M 


5% 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


5% 
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Al 
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16 


8 
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.116 
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.107 
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.085 


.044 
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.077 


.123 


A2 
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8 


16 


.068 


.122 


.054 


.112 


.059 


.125 


.073 


.117 


.070 


.133 


Bl 
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32 


8 


.045 
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.101 


.041 


.081 


.042 


.082 


.036 


.084 
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16 


16 
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.094 
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.104 
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256 


8 
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.064 


.123 


.057 


.113 


.049 


.101 


.042 


.092 


.061 


.130 


CI 


512 


64 


8 


.043 


.089 


.043 


.095 


.044 


.086 


.045 


.088 


.041 


.095 


C2 


512 


32 


16 


.046 


.109 


.067 


.112 


.052 


.093 


.051 


.096 


.043 


.086 


C3 


512 


16 


32 


.048 


.099 


.055 


.095 


.062 


.114 


.050 


.102 


.051 


.098 


C4 


512 


8 


64 


.038 


.097 


.055 


.100 


.047 


.100 


.046 


.093 


.042 


.092 


Dl 


1024 


128 


8 


.053 


.103 


.060 


.099 


.051 


.099 


.071 


.118 


.044 


.094 


D2 


1024 


64 


16 


.044 


.100 


.062 


.124 


.048 


.090 


.068 


.119 


.042 


.093 


D3 


1024 


32 


32 


.053 


.107 


.064 


.116 


.044 


.082 


.045 


.094 


.043 


.098 


D4 


1024 


16 


64 


.044 


.096 


.038 


.084 


.042 


.093 


.045 


.087 


.042 


.087 


D5 


1024 


8 


128 


.049 


.109 


.042 


.085 


.054 


.109 


.048 


.083 


.042 


.096 



Table 3: Rejection probabilities of the bootstrap test (4.12) under Hq for different choices of T,N 



and M. The data was generated according to model (5.2) with d = 0.1 and different values for 
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5% 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


Al 
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16 


8 


.068 


.112 


.060 


.103 


.030 


.081 


.060 


.108 


.053 


.111 


A2 


128 


8 


16 


.060 


.117 


.051 


.103 


.061 


.110 


.062 


.114 


.068 


.117 


Bl 


256 


32 


8 


.059 


.122 


.048 


.102 


.045 


.094 


.038 


.078 


.040 


.083 


B2 


256 


16 


16 


.053 


.109 


.041 


.095 


.047 


.093 


.040 


.080 


.048 


.091 


B3 


256 


8 


32 


.060 


.100 


.048 


.098 


.057 


.119 


.050 


.092 


.061 


.106 


CI 


512 


64 


8 


.059 


.110 


.064 


.122 


.052 


.099 


.056 


.099 


.055 


.101 


C2 


512 


32 


16 


.060 


.122 


.044 


.107 


.041 


.103 


.043 


.113 


.046 


.086 


C3 


512 


16 


32 


.061 


.116 


.056 


.122 


.049 


.089 


.046 


.088 


.052 


.099 


C4 


512 


8 


64 


.056 


.095 


.057 


.118 


.057 


.110 


.047 


.100 


.055 


.102 


Dl 


1024 


128 


8 


.063 


.125 


.054 


.102 


.039 


.086 


.044 


.101 


.051 


.098 


D2 


1024 


64 


16 


.051 


.109 


.061 


.112 


.047 


.107 


.056 


.106 


.047 


.100 


D3 


1024 


32 


32 


.055 


.092 


.057 


.111 


.048 


.095 


.057 


.106 


.047 


.119 


D4 


1024 


16 


64 


.065 


.124 


.061 


.116 


.043 


.092 


.048 


.087 


.049 


.098 


D5 


1024 


8 


128 


.059 


.116 


.052 


.095 


.049 


.093 


.035 


.075 


.050 


.115 



Table 4: Rejection probabilities of the bootstrap test (4.12) under Hq for different choices of T,N 



and M. The data was generated according to model (5.2) with d = 0.2 and different values for 0. 
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Figure 2: Power of the test (4- 12) and the Kolmogorov-Smirnov test of Preufi and Vetter (2012) 



at 5% level for the model (5.3) under the scenarios A1-D5 from Table^ 
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Figure 3: Power of the test (4-12) and the Kolmogorov-Smirnov test of Preufi and Vetter (2012) 



at 5% level for the model (5.4) under the scenarios A1-D5 from TableQ 
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Figure 4: Power of the test (4- 12) and the Kolmogorov-Smirnov test of Preufi and Vetter (2012) 



at 5% level for the model (5.5) under the scenarios A1-D5 from Table [7| 
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6 Appendix: technical details 

In the following, we will state two results which will be central for the proof of the statements in 
Sections 3 and 4. 



Theorem 6.1. If Assumption 2.1 is satisfied with doo < 1/4, the following statements are correct. 



a) 



E((i*i,T) F 2) t) T ) — (■Fl,T)-F 1 2,T + dN,T) T + Ct + jyl-4d 

where the vector C T £ JR 2 is of order 0(N 2 /T 2 + log(iV)/(M./V 1 ~ 4d < x >)). In particular, this 
term vanishes if the functions ipi{u) are independent of u for all I € -ZT. 



b) 



Cov((A,t, A,t) t ) = s T + o(r, dc 



where 



0(T, d 



O 



jyl-8doo] 



+ 



T3 



+ 



T 3 A {§<*»<|} 



arid Et is fie same as i/ie matrix S in ( 5.3) except that the integral f is replaced by Y^jLx- 
c) If doc < 1/8 ana 1 Zi , Z2 £ ^V"o wii/i 'l + ^2 > 3, i/ten we have 

cum(VfF 1>T ll,VfF 2tT ll) = ( T (l-^)(i-^)) 5 
where 1^ 6 iR'* denotes a vector containing merely ones (i = 1,2). 



It follows by the same arguments as given in Section 4 of Preufi and Vetter (2012), that there exist 



parameters ipi tP such that the bootstrap process X^ T defined in (4.7) can be represented as 

00 

X t,T = /Al,P Z t-h 



(6.1) 



1=0 



where Z\ are the innovations from part 2) of the bootstrap description. We now assume that the 
null hypothesis (3.5) holds, and consider the process 

00 



t,T,2 



(6.2) 



1=0 



where the coefficients Tpi = ipi(u) are the coefficients in (2.10). We then define Dj* 2 as Dj, in 
(2.18) whereby the random variables X^t are replaced by X^ T2 - The next theorem shows that the 
random variable Dj? can be approximated by D 2 ^ * 2 . 



Theorem 6.2. let a > be fixed and denote with At{u) the event where \d — d\ < a/4. If 

(6.3) 



Assumption 2.1 and the inequality 

\$l,P 

are satisfied, then 



-jl-ma*G!,«0 < g p 4 log(T) 3/2 v/ e A 



a) 
b) 



2,* 

T 



Var((L> T '* -D%* n )l 



T,2J L A T (a) 



0{v i ]og(T) 3 / 2 N*- 1+a T- 1 ' 2 ), 
= O (p 8 log(T) 3 log(iV) 2 iV max ( 8 ^ 1 ' ) +2Q r- 2 
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6.1 Proof of Theorem I6.lt 

Proof of part a): We define tj := tj - N/2 + 1, 4>i(u jtP ) := V>/(^rM, z a,b ■= Z a _ N/2+1+b and 



write similar to Dette et al. (2011b) 



1 M Lf J M LfJ 



J=lfc=l j = l fc= _ L iV-Lj 

I N I 

M LtJ W-1 oo 

2T^ 2-*> (2ttN) 2 ^ ^ 

j=l fc = p,g,r,s=0 i,m,n,o=0 

^tj+p,T,l^tj+q,T,m^tj+r,T,n^tj+s,T,o^~'l^tj iP—l^tj ,q—mZtj ,r—n^tj ,s—o\ 

En t + ^at t + An,t + -Bjv,t, 



where we use the notation 



M LfJ , N-l oo 



i(p-q+r-s)\ k 



N 



2T ^ ^ (2vrA^) 5 

j=l fc = L J p,<?,r,s=0 f,m,n,o=0 



. m Lf-J at-i oo 

^W.T = ^fE E (2-kN) 2 E E Muj)^m(Uj)lp n (Uj)ll)o(uj) 
j=l / ;= [ jv ~ 1 J p,g,r,s=0 Z,m,n,o=0 



.4 



JV,T 



—m^tj ,s— o\ i 
i JV I 

M LtJ N-l oo 

^E E 7^)2 E E ^ l(p - 9+r - S)Afc '^[^ >P -/^ >9 -^-,r-n^, )S -o] 
j=l ; g= [ JV 2 ~ 1 J p,g,r,s=0 i,m,n,o=0 

U.j,p) - ^l(Uj)^$m(Uj,q)$ n (Uj t r)$o(Uj,8) + ^l{Uj){^m{uj^ q ) - ^m(uj)^'4>n(uj,r)'4>o(uj,s) 



+ 



JV,T 



m LtJ at-1 oo 

^E E 7^)2 E E ^ i(p - 9+r - S)Afc '^[^, P -^t^-^,r-n^, S - ] 
j=l fc = [ JV ~ 1 J p,j,r,s=0 (,m,re,o=0 

+^t J +p,T,l^t J +q,T,m(^t J +r,T,n ~ i(%>))i(«j, S ) + %+p^l%+q,T,m^i j +T,T,n(^t j +s,T,o ~ Muj 

Note that 5jv,T corresponds to the error which occurs if the coefficients tpt,T,i are replaced by tpi(t/T) 
and that ^4j\tt contains the approximation error of ipi(t/T) through ipi(tj/T) with tj denoting the 
midpoint of the j-th block. The following four statements conclude the proof for IEfi 7 !^] 

M 

- h (6-1) 



.7 = 1 71 
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= 


An,T 


= 


Bn,t 


= 



M^i)^(S- <«•«> 

' ' (6-7) 



Proof of (6.4): Without loss of generality, we only consider the first summand in Eh T . Due to 
the independence of the random variables Zt, we obtain that only those terms contribute to the 
sum where the conditions 0<p = q + l — m < N — 1 and 0<r = s + n — o < N — 1 are satisfied, 
which implies the inequality max{|/~ — m\, \n — o\} < N — 1. Thus, the first term in E^ T can be 
expressed as 

m m 



^E E E UnMrniuMniuMo^e-^^^ 



2T ^ ^ (2irN) 2 ^ 

" A I v ' l,m,n,o=0 

1 ' \l-m\<N-l 



\n-o\<N-l 

x(N- \l-m\)(N- \n-o\) 

^ M 1 . oo 

= ^EE (2irN) 2 ^ ^(^•)^ m (u i )V'n(uj)V'o(ui)(A r -|/-m|)(iV- |n-o|), 

j=l /i=— 1 l,m,n,o=0 
\l-m\<N-l 
\n-o\<N-l 
l—m+n—o=hN 

where we used the well known identity 
N-l 



E exp(-iA fci Arr) 



fc=0 



iV if r = iV/i for some h £ /Z 
else, 



[note that we only have to consider three possible values of h since max{|/ — m\, \ n — o\} < N — 1]. 
It is easy to see that 

E NT = En T0 + E]^ Tl + E\j T2 + E\f T3i 

where 

- M oo M „ v 

E N,T,0 = ^E 7^2 E Muj)^m(Uj)lpn(Uj)Muj) = ^^^2 f 2 (Uj,X)d\, 
j=l ^ ' l,m,n,o=0 j=l 77 

I— m+n— o=0 

j Af 1 oo 

E N,T,1 = ^E E E ^l{ u o)^rniuj)^ n {Uj)^ (uj) (6.9) 

j'=l ?t=— 1 l,m,n,o=0 
\l-m\<N-l 
\n-o\<N-l 
Z— m+n— o=hN 



X ( — iV|Z — m| — iV|n — o| + |/ — m\\n — o|), 



^ M oo 

E N,T,2 = ^E E E Muj^miUj^niu^tpoiUj), (6.10) 



j=lft.e{-l,l} l,m,n,o=0 
\l-m\<N-l 
\n-o\<N-l 
l—m+n—o=hN 
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^ M oo 

E N,T,3 = MuMmiUj^niUj^oiUj). 



(6.11) 



j=l l,m,n,o=0 
N<\l-m\ 
N<\n-o\ 
I— m+n— o=0 



In order to complete the proof of (|6.4l), it therefore suffices to demonstrate that the last three 



expressions are of order 0( Nl } 4dao ) . We commence with (6.9). Setting o = I — m + n — hN > 

l-m- hN\ 



and using (2.4), it follows that there exists a constant C G M such that 

Ml oo 



E 



l 

iV,T,l 



< 



C 



EE E 



1 



1 



1 



MN ^ ^ ^ p--doo m, 1_d °° n 1 ~ do ° (l — m + n — /iiV) 1_d ° 

j=l 7i=-l l,m,n=l v ' 

l<|i-m-/iJV|<JV-l 
l<Z-m+n-/i7V 



(6.12) 



(note that all terms where one of the variables l,m,n or l — m + n — hN vanishes are of smaller 
or the same order). This argument will be employed continuously throughout this proof without 
mentioning it explicitly. Note that the summand \l— m\ does not occur in the numerator of the above 



expression due to the symmetry of \l — m\ and In — o| in (6.9), and that C G M + denotes a universal 



constant throughout the whole proof. Setting z := 1 — m — hN, we obtain \z\ = \l — m — hN\ < N — l 



and the expression on the right hand side of (6.12) can be written as 

1 oo ., , 



c_ 

N 



E E E 



(z + m + hN) 

h=-l ze% m,n=l y ' 

1<|*|<JV-1 \<z+m+hN 
l<n+z 



1 



1 doc' fft^- doc doc _|_ ^gAl do 



i 



7.2\ i 1 \ v \2d ao 

JV z — ' ' \z + TiiVl 1-2 ^ 00 ~ 7Y 1_2d °° ' z — ' b + /tiVl 1_2dco ~ N 1 - 4 ^ ' 

/i=-i ze^ 1 1 /i=-i ze^ 1 1 

1<|«|<JV-1 i<k|<JV-i 



where a n < 6 n means that a n /b n is bounded by some finite constant for all n G IN. By using (2.4), 



(6.8), (7.2) and similar arguments, we obtain that (6.10) is bounded by 



E E 

hef-i.l} ze% 

l<\z\<N-l 



1 



\z + hN\ 



1 2f/<3Q I ry I 1 2t/<3 



jyl-4doo 



and since (6.11) is shown analogously, we therefore conclude the proof of (6.4) 



Proof of (6.5): The result follows by similar arguments as used in the treatment of (6.9)-(6.11) 



Proof of (6.6): Without loss of generality, we only consider the first summand and replace 



ipm(uj t g)^ n (uj tr )jp (uj tS ) by ip m (uj)ip n (uj)ip (uj) [the error due to this replacement is negligible, 



(2) 

which follows by analogous arguments as given for the term A NTl at a later stage of this proof]. 
Due to the independence of the random variables Zt, we obtain the sum of three terms [compare 
the definition of Ej^ T for the first two summands and the definition of E^ T for the third one] and 
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we restrict ourselves to the first one, i.e. we only consider 

JV-l 

V V 1 

2T 



A 



N.T.I 



, M LfJ 

rr XI zE 



E E 



-i(p-g+r-s)A fc 



(2^iV) 2 

j=1 fc = [ JV ~ 1 J p,q,r,s=0 l,m,n,o=0 



Ei[Zt j ^iZ tjt q- m ]B[Zt j} r-nZt j ,8-o][i>l{v>j#) - r ipi{u j ))ijj m {uj)%lj n (uj)i) {u j ). 
Using a Taylor expansion, we can write 



N/2 + l + p^ t ^"( md>p ) f -N/2 + l+p\< 



T 



+ 



T 



with r)ij >p £ (uj — N/(2T),Uj + N/(2T)), and therefore An,t,i splits into two terms which will be 
denoted by A^ T x and A^ T 1 in the following discussion. We start with the treatment of the first 
summand. Employing the independence of the innovations we obtain that the indices corresponding 



to non vanishing terms must satisfy q = p + m — I and n = o + r — s. Applying (|6.8|) afterwards 

,0,1} 

log(f) 



yields < m = / + r — s — hN with h G {—1, 0, 1} and this combined with (2.4) and (2.6) implies 
^ 1 JV-l oo 
A N,T,1 ~ J72 XI XI XI 



j\[2 z^ Z^ Z^ Z 1_d °° (l + r — s — 7iiV) 1_d °° (o + r — s) 1 ~ d ° a o 1_d °° 

l<o+r— s 
l<Z+r-s-/iJV 
0<|r-s-WV|<JV-l 



JV-l 

E 

p=0 

0<p+r-s-/iJV<JV-l 



-N/2 + 1 +p 
T 



We restrict ourselves to the cases |r — s\ > 1 and |r — s — hN\ > 1 since the remaining terms are 
of smaller order. A straightforward calculation yields 

JV-l 



E 



p=0 
0<p+<?<JV-l 

N 



-N/2 + l+p 
T 



2T 



x 1 



{9=0} 



+ min 



N\q\ (N 



T 



T 



X l{l<|g|<JV-l} 



(6.13) 
(6.14) 



and by using the second summand it follows that A N T 1 is bounded by 

1 JV-l 



c 

NT 



V V V ^ 1 1 —\r-s-hN\ 

Z—^ Z- / Z- / [l — d-a ^ _|_ ^ _ g _ /jj\T)l — rfoo ( _|_ j> _ s)l~ ^oo qI — rfoo 1 1 



Ji=-lr,s=0 



Z,o=l 
l<o+r— s 
l<l+r-s-hN 
l<\r-s-hN\<N-l 



< 



log(iV) 



JV-l oo 



1 



1 



t ti (o+r-sy 



< 



log(iV) 



^i-2d . -p XI XI ( r) _)_ r _ .sji-'/.v (ji-'-'-v •-- /\- j -•:></, 



l<o+r— s 



JV-l 

E XI |r — .s 

r,s=0 



1 



l-2d„ 



< 



log(iV) 



A/jV 



l-4rf„ 



where we used Lemma |7.1b ) and |7.1b ) for the first and second inequality, respectively. Next, we 
show that Ajy T 1 is of order 0(N 2 T~ 2 ), and for this reason we choose e > such that 1— 4^— e > 0. 
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Using ( |2.4| ), (2.6 ), (2.7) and h E {— 1,0,1}, the claim then follows by a further application of Lemma 

E30 



iV2 



log 2 (0 



AT-1 oo 

;i-rfoo + r _ s _ hN) 1 '^ (o + r - s) 1_rf °° o 1-d » 

r,s=0 Z,o=l 

l<o+r— s 
KJ+r-s-MV 



AT-1 

E 

0<p+r-s-MV<A/-l 



-7V/2 + 1 +p\2 



r 



AT-1 

< _ 

r,s=0 i,o=l 

l<o+r— s 
l<Z+r-s-WV 



AT-1 oo | 



" e (/ + r - s - hN) l ~ d °° (o + r - s) l - d °° o 1_do ° 



N N ^ 

< - E 



i 



T 2 * — ' \r 

r,s=0 



< 



s _ / l jV|l-2d 00 -e U _ s |l-2d tXJ ~ ^2 " 



Proof of (6.7): The statement follows from (2.3) and similar arguments as given in the proofs of 



(6.4) and (6.6). 



In order to proof the assertion for i*2T> ° ne proceeds in the same way and the details are omitted 
for the sake of brevity. However it turns out that the expression corresponding to T does not 
vanish in this case and there appears an additional bias, which will be denoted by djvy. 

Proof of part b) We restrict ourselves to the proof of 



m L£l 



Var(iV) = i Var( - ^ £ I N {u 3 Ak. 



AT 



5 1 
ttTM 



M 



i=i 



^ / f 4 ( Uj ,X)d\ + 0(T,d c 



and recall the definition of the remainder 



/ \os(N) \ /N 2 N 2+4d °° 



»)■ 



All other statements can be verified completely analogously and the details are omitted. By com- 



bining the arguments from the proof of part a) and from Dette et al. (2011b), we obtain that 



M Lf 



Var (^zC £ iNiu^Xk^)' 2 ) =32rV) + 8rV') + OU\r/. x ). 



where 



V*(u) 



M Lf. 1 



T2 Z_3 Z_3 



1 



N-l 



N-l 



E E E E 



T 2 ^ ^ (2vriV) 4 

jij2=l fci,fc 2 =-L JV ~ 1 J Pi,9i,ri,si=0p2,g2,T , 2,«a=0ui ) Mi,a;i,j/i=0v2,«'2,a!2,s/2=0 
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(«n )Vv )^i )ij>vi ( u h )^2 {u h )ip W2 (u h )ip X2 (u h )ip y2 (u h ) 

e -j(pi-gi+ri-si)A fcle -i(p2-92+»*2-S2)A fc2 



12 ,T2-X2 Ztj 2 ,S2-?/2] 



and 



1 a/ LfJ 



V> ) 



1 



N-l 



N-l 



E E E E 



T 2 ^ ^ (2vriV) 4 

jiJ2=l fc 1 ,fc 2 =-[ JV ~ 1 J Pl,9l,' , i,si=0p2,?2,» , 2,S2=0fi,w)i,a;i.j/i=0t)2,W2,a;2,?/2=0 

^2 i U j 2 )' l l ; W2 ( U j 2 )'4 , X2 ( U j2 ) VV2 ( n j2 ) 



IE ,gi - wi ^ 2 ,g 2 - w 2 ] E [^tjj ,ai -2/1 ^ j 2 ,S2-lto J 



We start with V*{u). Because of the independence of the random variables Zt, the restrictions 
Pi = qi+vi-wi, p 2 = ri+v 2 -xi + (ji -fe)N, q 2 = s\+w 2 -yi + (ji - j 2 )N and s 2 = r 2 +y 2 -x 2 
are necessary for a non vanishing term. Consider h\,h 2 6 { — 1, 0, 1} and sum over k\, k 2 by using 
(6.8). Then, V*(u ) can be written as 

, M Ifj 

">'> - ^ E E 



(2ttA0 4 



N-l 

E 



E 



E 



ji.J2=l fci fc 2 =-l JV ~ 1 I V ' ' 9i,^i,si,r- 2 =0 v 1 ,w 1 ,x 1 ,y 1 =0 v 2 ,w 2 ,x 2 ,y2=0 

2 0<qi+vi-vn<N-l 0<r 1 +v 2 -x 1 + (j 1 -j 2 )N<N-l 

0<si+w 2 ~yi+\ji-j2)N<N-l 
0<r 2 +y2-X2<N-l 

V2 \ u j2 (^2)^2(^2) 
g — Wl+Tl — SlJAfcj g — i(n+t&— «!— 51— 1U2+J/1— J/2+!C2)Ak 2 



. M 1 

E E 



1 



N-l 



M 2 



il,j2 = l /ll,/l2= 



E E 



E 



ri,si=0 «i,«/i,a;i,j/i=0 l>2!W 2 ,a; 2 ,2/2=0 

i>i— toi+n — si=ftiJV ri+ti2— xi— si— ii)2+J/i— y2+x 2 =h 2 N 

o<|i;i-wi|<jv-i 0<n+v2-xi+(ji-j2)N<N-i 

0<s 1 +W2-yi+\ji-j2)N<N-l 
0<\y 2 -x 2 \<N-l 

(u h )ip m (u h )if> Xl (u h )i> yi (u h )il> V2 (u j2 )ip W2 (u j2 )ifj X2 (u j2 )ip y2 (u j2 ) 
(N 2 — iV|y2 — x 2 \ — N\vi — wi\ + \vi — wi\y 2 — x 2 \). 



An application of (2.4) yields similar to the proof of part a) that the above expression is of order 
O(jvi-iLooT) j if 



(i) hi,h 2 € { — 1,1} [compare (6.10)], 



(ii) j\ 7^ j 2 [we prove this claim in Lemma 7.2 in the appendix since this kind of restriction did 
not occur in the proof of part a)] , 



(hi) we drop —N\y 2 — x 2 \ — N\v± — wi\ + \v\ — w\\y 2 — x 2 \ [compare (6.9)], 



(iv) we drop < \v% — wi\ < N — 1 and < \y 2 — x 2 \ < N — 1 [compare (6.11)], 
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(v) we drop < r\ + V2 — x\ + (ji — j2)N < N — 1 and < s\ + W2 — yi + (ji — 32)N < N — 1 



[compare (6.9)]. 

By rearranging the equation v% — w\ + ri — s% = to < si = ri + vi — < N — 1, it follows 

^ A/ N-l oo oo 

ii = l ri=0 ui,u>i,2i,yi=0 i>2,ia>2, 22,1/2=0 

0<ri+ui— Wi<N— 1 u>i — f i +t> 2— 21—102+1/1 — 1/2+22=0 

t/» Wi (uj^ip^ {u h )i) Xl (it/JVyi {u h )ip V2 (u h )ij} W2 {u h )ij) X2 (u h )il> V2 (u h ) + 0(T,d 00 ) 



— y— y 

M 2 N ^ (2vr) 4 ^ 

j 1= l v 1 ,w 1 ,x 1 ,y 1 =0 



E 



V 2,H>2,22, J/2=0 

wi~vi+v 2 -W2+yi—xi+X2—y2=0 



if) Vl (u h )ip wi {u h )il) xi {u h )ii) yi {u h )ip V2 {u h )il) W2 {u h )ip X2 {u h )i> V2 (u h ) + 0(T,d 00 ) 



1 1 

TM^k 



M r 

/ f\u h , A) d\ + 0(T,d oc ). 

h=l J - n 



By using the same techniques as in V*{v ), we obtain 
- M 1 . JV-1 

^ (" ) = TJ2 S Z S 



oo oo 

M 2 ^ ^ (27rA0 4 Z 2 

jl=l/ll=-l g2,l'2,S2=0 Vi ,W\,X\ ,J/l=0 U2,l«2 ,^2 ,2/2=0 

0<92— r2+S2+'lliV'<Ar— 1 Dl—D2+l«2 — l«l+2l— ^2+1/2— 1/1=0 

i>vi (uj t )ip wi {u h )^ xi {u h )ip yi {u h )ij) V2 {u h )i} W2 {u h )i) X2 {u h )i> y2 {u h ) + 0{T,d oo ). 

Note that, in contrast to the term V*(v ), the cases where h\ £ {—1,1} do not vanish. In fact, 
using the three equalities 

N-l 



N-l 

E 

g2,>"2,S2=0 
0<92-r'2+S2<A r -l 

N-l 



*N 3 + 0(N 2 ), Yl =In 3 + 0(N 2 " 

Q2,1 - 2,S 2 =0 
0<q 2 -r 2 +S2+N<N-l 



6 



E 



1 



iV 3 + 0(N 2 ) 



we deduce 
V*(u ) 



Q2,r2,S2=0 
0<q2-r 2 +S2-N<N-l 



2 1 1\ 1 

+ 77 + 



3 6 6/ M 2 iV ^ (2vr) 
ji=i 



M j oo 



E 



Hi ,U>1 ,21,1/1=0 1>2,1«2, 22 ,1/2=0 

Dl— Wl+2l -22+J/2—l/l=0 



tp vi (u h )i/j m (u h )4> Xl {u h )i) yi (u h )i/j V2 (u h )ip W2 {u h )ifj X2 (u h )ip y2 {u h ) + 0(T,doo) 
M 



1 1 

TM2tt 



J] / f\u J1 ,X)d\ + 0(T,d O0 ). 



(6.15) 

ii=i' , - ,r 

Proof of part c) Exemplarily we consider the case h = and I := l\ > 3. The other cases can be 
treated similarly with an additional amount of notation. Following the same lines as in the proof of 



Theorem 3.1 in Dette et al. (2011a), it is sufficient to choose an arbitrary indecomposable partition 



^(•^itjjjOi— vi^ti 2 ,a2— wi)i {Zti 3 ,a3—xiZti 4 ,a4,—yi)i (^t^ ,a U -i-xi ^ ,au-Vi ) } 
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of the table 



^tj 1 ,pi-g 1 Ztj^,q\—m\ ^tj x ,r\—n\ Ztj x ,s\—o\ 



^tj v Pl-9l \,9rmi ^tj v n-ni ^tj v si-oi 



sec 



Bollinger (1981)] and to treat the term 



M 



LfJ 



JV-l 



E E E E 



E 



(6.16) 



T2 1 

j!,...,jl=l fc lv ,,,fc;=-L JV J 1 J Pl,-,si=0 v 1 ,w 1 ,x 1 ,y 1 =0 vi,wi,xi,yi=0 
ifrvi { U jl,pi ( U jl ,91 ( U il,?"l ) Vfyl ( U jl ,Sl )^2 { U 32,P2 )' l l ; W2 ( U j2,q2 ) V^2 \ U h,r2 ) % l ) V2 ( U j2,S2 ) 



IE [Z't il ( ai _ Vl 2^ j 02— wj^l^tig, 03— Ei^t^ ,04—1/1] IE [Zt i5t a 5 -V2^ti 6 ,a 6 -w 2 ]'^ l [^ti T ,a7-X2^ti s , as-JteJ 
^[^ti4j_3,a4!-3-^^i 4i _ 2 ia4!-2-wJ^[^*i 4! _ 1 ,a4!-l-2!^*i 4 pa 4 ;-y i ] 

with{ai,a 2 . . . ,a 4i } G {pi, . . -,pi,qi, ...,qi,n,.. . ,r h s\, . . .,si}, a* / Oj fori ^ j, {ii,i 2 , • • • ,«4«} G 
{iiij2i • • -iz}, and \{ii,i 2 , ■ ■ ■ ,Ui}\ = I- We now discuss the conditions which yield a contribution 
different from in this sum. Note that some of the ik are equal to each other and we will therefore 
write ji, ...fji for the I different values and consider as a function depending on ji, Using 
the independence of the random variables Z-t and summing with respect to k\ , . . . , ki , the conditions 

O-Am+l ~ «4m+2 + »m+l ~ »m+l + (Um+1 ~ Hm+2)N = for 771 = 0, . . . , I - 1, (6.17) 
a4m+3 - 04m+4 + Um+1 ~ X m +1 + (i4m+3 ~ lAm+ijN = for 171 = 0, . . . , I — 1, (6.18) 

Pi — Qi + Ti — Si = hiN for i = 1, 2, . . . , I and /i, G { — 1, 0, 1} (6.19) 



follow. Rearranging the equations in (6.19) for a variable and plugging them into the I equations 



(6.17) (where in every equation only one variable is replaced) yields, due to the indecomposability 



of the partition and v m+ i, x m+ i > 0, that the conditions 

(1) < vi = ai - a 2 + a 3 - + wi + (i\ - i 2 + h{)N, 

(2) < v 2 = a 7 - a 8 + a 9 - Si + w 2 + (is — h + h 2 )N, 



(I) 0<Vi= 6 /_ 5 - a 6 ;_ 4 + a 6 ;_ 3 - 6 /_ 2 + W[ + (i 4i _ 3 - i 4i _ 2 + h{)N, 

(l + l) < x\ = a 5 - o 6 + yi + (i 3 - u)iV, 
(Z + 2) < x 2 = an - a i2 + y 2 + (i 7 - i 8 )iV, 



(6.20) 



(2Z) < xi = a 6/ _i - a 6i + y t + (z^— l - ^)N 
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must hold, where {01,62, • • • ,aei} G {pi, . .. ,s/} and |{ai,<22, • • • , a6z}| = 3/. By employing (2.4) 
we can bound the above expression up to a constant by 



I M 



1 



JV-1 



E E E 



1 T5 

ji,—,tt =1 h\,...,hi=—l ai,...,a 6 i=0 

00 00 

E - E 

m,yi=l w t ,yi=i 

a 1 -a2+az-a4,+w\ + {ii-i2+hi)N>l a & i^-au-i+au-'i-a-u-2+wi + {iu~i-iil-2+hi)N>l 

as— a 6 +yi+[i3— U)N>1 a m _ 1 -a 6l +y t +(i 4l _ 1 -i 4l )N>l 

1 111 



(ai - a 2 + 63 - 04 + wi + (n - «2 + hi)iV) 

1 



1— doo ...1— do 



w 



(as - a 6 + yi + («3 - u)iV) 
1 



1— doo -.1 



2/1 



(07 - d 8 + a g - aio + w 2 + (*5 - *6 + h 2 )N) 



1 



(an - 012 + y2 + (»7 - is)N) 



l C^OO yj.l ^OO 



(a 6Z _ 5 - a 6 i_ 4 + a 6 ;_ 3 - a 6Z _ 2 + i«j + (%_ 3 - *4Z-2 + M^O 1 d °° d °° 

1 1 
(d 6 /_i - a 6i + yi + (i4i-i - hO^O 1 ^ 00 ^ 1_do ° ' 

Using Lemma |7.1b ) in the appendix, this term can be (up to a constant) bounded by 



1 T5 

Wn 21 



L M 



E E 



N-l 

E 



N-l 

E 



Jii— ii! =1 hi,...,hi=— 1 01,62,... ,O6=0 a 6 i_5,a 6; _4,...,a 6 i=0 

|ai-02+a3-a4 + («l-j2+'ll)A r |>l 2el-4+S6J-3— ^J-2+(Ul-3— ^I-S+M-NI^ 1 

|a 5 -a6 + fe-M)N|>l I a 6z „i -a 6i + (i 4! _ L -i 4J ) > 1 

1 1 



|ai - a 2 + a 3 - a 4 + (ii - z 2 + /j-i)iV| 1 2d °° |a 5 - a 6 + (i 3 - z 4 )A^| 1 2d °° 

1 1 
|a 7 - a 8 + a 9 - aio + («5 - h + h 2 )N\ 1 - 2d °° \a n - a 12 + (i 7 - z 8 )iV| 1—2^00 

1 1 

|a 6/ _ 5 - a 6Z _ 4 + a 6i _ 3 - a 6i _ 2 + (242-3 ~ H1-2 + hi)N\ l - 2d °° |a 6i _i - a 6Z + (i 4z _i - i^JiV] 1 - 2 ^ 

We now assume without loss of generality that 

06m+l — «6m+2 + «6m+3 _ a6m+4 + (Um+1 ~ Um+2 + h m+ i)N > 1, 
06m+5 — fl6m+6 + {Um+3 ~ Hm+'ijN > 1 



(6.21) 



holds for m = 0,1, 2, ... ,1 — 1 (the more general case follows analogously with an additional amount 
of notation). In this case the absolute values in the above expression can be skipped. It follows, 
as in Dette et al. (2011a), that the conditions on the 5$ imply that, if i\ is chosen, there are only 



finitely many possible choices for k = 2, I. Thus it suffices to consider the following sum 



1 T2 

M l ~ x N 21 



N-l 

E 



N-l 

E 



Oi,O2,...,66=0 a 6 ;_5,a 6 i_4,...,a 6; =0 
0i-a2+03-04+Ci7V>l a 6i _ 5 -a 6 i_4+a 6i _3-a 6i _2+CiW>l 
a 5 -a 6 +Ci +1 N>l a, 6 i_ 1 -d 6l +C 2 iN>l 
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(ai - a 2 + a 3 - a 4 + CiiV) 1 " 2 ^ (a s - a 6 + Cz+iiV) 1 " 2 *" 

1 1 
(a 7 - a 8 + a 9 - 5 10 + C 2 N) 1 - 2d °° (on - a 42 + C^AO 1 " 2 ^ 

1 1 

(a w _ 5 - a 6 z-4 + m- 3 - a 6 i- 2 + QN) 1 - 2 ^ (5 6 /-i - a 6 j + C 2 iN) 1 - 2d ^ 

with Ci, C 2 , . . . , Q € {-1, . . . , M} and C, +lj Q +2 , . . . , C 2 / € {0, 1 . . . , M - 1} (because of flOl ) 
and 5j € {0, 1,2, . . . ,N — 1} there are no other possible values for Cf). We remind that (due to 
the indecomposability of the partition) the 2/-fractions inside the addend are hooked. This means 
that for two different fractions there exists a chain of fractions (starting with the first considered 
fraction and ending with the second one) , such that in every element of the chain there exists at 
least one element d{ which also occurs in the consecutive fraction. We will perform a summation 
in a particular way and in order to illustrate this, we consider the first two fractions and assume 
that di and clq are (up to a the algebraic sign) the same. We distinguish two cases. 

(i) If 5i = &6, we obtain from Lemma |7.1b .) that 

N-l 1 1 

E 

oi =0 

Si — 0,2+0,3— S4+C1 JV>1 
a 5 -a a +C l+1 N>l 



(Si - d 2 + a 3 - a 4 + dN) 1 - 2 ^ (a 5 - a 6 + Q+iiV) 1 - 2 ^ 



< 



1 



< 



-02 + a 3 - a 4 + a 5 + (Ci + Ci+i)J\0 1 - 4 *» 
(-02 + a 3 - a 4 + a 5 + (Ci + Q+iJiV) 1 - 2 ^ 



(6.22) 
(6.23) 



Furthermore we have — £2 + 53 — 04 +05 + (C\+Ci+i)N > 2 which follows from the conditions 
ai — a 2 + a 3 — a 4 + C\N > 1 and S5 — a§ + Ci + \N = 0,5 — 01 + Ci + \N > 1. 

(ii) If 5i = — 06 and — a 2 + a 3 — a 4 — as + (Ci — Ci+i)N 7^ 0, it follows from Lemma [7TT| o) that 
JV-l 

E 

61=0 

0,1 — 0,2+0,3— a,4+CiN>l 
a 5 -a a +C l+1 N>l 



< 



< 



(01 - a 2 + a 3 - a 4 + CiiY) 1 " 2 ^ (a 5 - a 6 + Ci +1 Ny- 2d °° 



1 



a 2 + a 3 - a 4 - a 5 + (Ci - C^iVf- 4 ^ 



(6.24) 
(6.25) 



a 2 + a 3 - a 4 - a 5 + (d - C I+ i)tf I 1 " 2 *- ' 
In both cases, it is possible that variables cancel out, for example if a 4 = as and a 3 = as in the 



first and second case, respectively. We apply ( 6.22 )-( 6.25) in total 21 — 2-times. In the first 21 — 4- 
applications, we use (6.23 ) and ( 6.25 ) (depending on the algebraic sign of the variable which appears 
in both fractions) and in the (21 — 3)th and (21 — 2)th application we employ (6.22) and (6.24). 
We furthermore assume that h variables cancel out while utilizing these inequalities. Afterwards 
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3/ — (21 — 2) — h = I + 2 — h variables remain with < h < I, namely aw— i, a%\ and / — h other 
variables with values in {0, 1, 2, . . . , N — 1}. Denoting these I — h variables with hi, hi, • • • ? ^i-h we 
obtain 

1 rk N ~ l N - x 

E ■■■ E 



ai,a2,---i"6=Q a6I-5! a _6!-4v i a6!=0 
ai-a 2 +a 3 -a 4 +CiA'">l a 6; _5-a 6i _4+a6!-3-a6l-2+C , iiV>l 
a5-a6+C;+iW>l a6;-l-a6i+C2iN>l 

1 1 

(ai - 5 2 + a 3 - a 4 + CiAO 1 " 2 ^ (a 5 - a 6 + Q+iiV) 1 " 2 ^ 
1 1 

(a 7 - 5 8 + a 9 - aio + C 2 N) 1 - 2d °° (a n ~ ~a\2 + C/^iV) 1 - 2 ^ 
1 1 



1 T5 JV " 1 N ~ X 

1 ' E E 

16!-Iia6!=0 Oi,O2,...,b(_h=0 
a6 ; -l-a6 i +C 2i Af>li<|a 6i _ 1 _a 6i +^- h 1 (-l) fe J26 J +E, 2 L^ 



(a 6/ _ 5 - a 6Z _ 4 + a 6 /_ 3 - a 6/ _ 2 + QN) 1 2d °° (d 6/ _i - a 6i + C^/N) 1 2do 

1 T2 

a 6i-li a 6i- 

N h T (2l-A)2d oc | 

|a 6i _i - a 6 « + E5=i(-l) fcj '2bj + Ef=i (-^CjN^-^ 1 «6Z + ChW) 1 - 2 ^ 

with some % E {0, 1}. We first consider the case h = I. If Y?j=i (- l ) k] c i N = and C 2l = 0, it 
follows that the above term equals 

i t In 1 t( 21 -^ ^ i < i jjjg+irg-^ _ r (i-4)(i- 8doo) 



M 1 - 1 N 21 . 4^ (oej-i - a6«) 2_Mtxl ~ M/_1 

a 6!-l;^6!— 



If | Ej=i ( — l) fej ' C;-?V"| > 1 or C21 = 1, we apply Lemma 7.1 1) and b) in order to obtain the same 



upper bound (it can be shown that, in this case, there appears an additional factor iV 1 8d °° in the 
denominator, so the corresponding term is, in fact, of smaller order). The same upper bound holds 
for h < I - 1. □ 

6.2 P roof of Remark l3~6t 

If we replace F± : t by the corresponding integrated version F± : t = 

YLT=iZjN^\) 2 d\ the 

derivation of the asymptotic variance can be carried out almost analogously as in the proof of 



Theorem 6.1a) except that the term, where the variable hi in V*(v ) equals —1 or 1, does not 



occur, because for the integrated version one can use 



If* 1 if r = 0, 

— / esxp(-i\r)dX = < (6.26) 
2?r J-n else, 



(for r G 2Z) instead of (6.8). Therefore, in the integrated case, we obtain 



V ^ ^ 3M 2 iV £ (4vr) 2 (2vr) 2 ^ £ 

jl=l Wl, 101, Xl, 2/1=0 V2,W 2 ,X2, 2/2=0 



fi-f 2 +u>2— M)i+a;i-Z2+2/2 -2/1=0 
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Oil )Vv (uji )il>vi (ujMv 2 (ujMw 2 {u h )ip X2 {u h )ip y2 [u h ) + 0(T,d O0 ) 



2 1 1 



M w 

J2 / fHu n ,X)dX + 0(T,d OQ ) 

31=1 J -* 



instead of (6.15) and we recall that the order 0(T, d^) is defined in (6.1). This yields that the 
asymptotic variance of y/TF^T is |^ f 4 (u, \)dud\ and does not coincide with the asymptotic 

variance of s/TF\ t- 



6.3 Proof of Theorem I6T21 

Proof of part a): We define F* T and F* T2 as Fi t where the observed data Xt t T are replaced 



by X^ T and X^ T2 , respectively. By using (6.1) and writing I^(u, A) for the bootstrap analogue of 
In(u, A), we then get 



1 M L ^ J 

E ((^T,T - F l,T,2) l A T ( a )\ X l,T, ...,X TiT ) = — ^ } y 



1 



N-l 



j'=ife=-L- J 



E E « 

p,q,r,s=0 l,m,n,o=0 



,-i(p-q+r-a)\ k ,N 



i , l,m,n,o,p^A T (a)^[Zt jl p-lZf jtq ^ m Z^ r _ n Z^ s _ \ 

[compare the first set of equalities in the proof of Theorem 6.1 a)], where ipi t rn jn ,o,p = i J i,p'4>m,p' l l ) n,p' t Po,p- 
ipl^mtpntpo- By using the decomposition 

i>l,m,n,o,p = W,p - ^l)$m,p$n,p$o,p + ^lWm,p - i>m)$n,p$o,p 
+1pl1pm(^n,p ~ 4>n)^o,p + ^l^m^n(^o,p ~ i>o) 

the above expression splits into four terms and for the sake of brevity we only consider the first 
one. The other cases are treated similarly. As in the proof of Theorem |6.1| a) we then obtain terms 
E^* T and E 2 ^* T which are defined as Ej^ T , E 2 NT where the coefficients ipi(uj)ip m (uj)i(^ m (uj)ip (uj) 
are replaced by (i^i >p — ^i^m^n^o^ [note that A* NT = B* NT = since the coefficients of the 



bootstrap process do not possess any time dependence]. If we employ (6.3) and combine it with 
the fact that \d — d\ < a/4 on the set Ax(ct), we get 



which together with (2.4) and the assumptions of the theorem implies 

\i>l, P \ < C\l\ a /^~ l Vl,p£lN. 



(6.27) 



(6.28) 



Note that the coefficients in the MA(oo) representation of the bootstrap processes do not depend 
on time and that for such processes we only required 



sup |^(«)l <C\l\ 



d-l 



(6.29) 
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in the proof of Theorem |6.1| a) to obtain appropriate bounds for the error. By using (6.27) and 



(6.28) instead of (6.29) and similar arguments as given in the proof of Theorem 6.1 a) it follows 



that 



H(Kt - n T ji AT(a) \x hT , ...,x T>T ) = f*>- + o(^- 1 +Viog(r) 3 / 2 r- 1 /2) ) 

where is defined as F^t but with f(u, A) replaced by 



2tt 



^2 i>l,m,n,o,p exp(-zA(Z -m + n-o)) x l Ar(a ). 

l,m,n,o=— oo 



Since F^ T and F^ T 2 are treated analogously, the claim follows [note that F^' T cancels out since 
the coefficients do not possess any time dependence]. □ 

Proof of part b): The assertion follows by similar arguments as given in the proof of Theorem 



6. 1| b) employing (6.27) and (6.28) instead of (6.29) as above. The details are omitted for the sake 
of brevity. □ 



6.4 Proofs of the results in Section 3 and 4: 



Proof of Theorem 3.2 : The claim follows by employing the Cramer- Wold device in combination 
with Theorem 16.11 □ 



Proof of Theorem 3.4\ Similarly to the proof of Theorem 6.1 the two equations 



7 = 1 J ^ 7T 



Var (jl) = O 



1 



can be established. By Markov's inequality the assertion of the theorem follows. 



□ 



Proof of Theorem 



4.2 



We define D^*a as ^t*2 an< ^ a as but with X^t replaced by 
Xt(t/T) from (2.10). Then part a) is obvious, because we have ipi = ipi{u) for all u G [0, 1] under 
the null hypothesis and Zt and Z\ are both independent and standard normal distributed. Part 



s2,* 



b) follows from the proof of Theorem 6.1, so we focus on part c) and d). Note that (2.11) and 



Theorem 6.1 a), b) imply 

CiiV max(8d-i,o) /r < Yss{D^ a ) /T < c 2 (N max ^- 1 '^ + log(iV)l { d = i/ 8} )/T 
which directly yields part d). If we have 



P(A T (a)) ->• 1 as T -> oo 



(6.30) 
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for every a > 0, Part c) follows from Theorem 6.2 (4.8), the conditions on the rate of p(T) if a is 
chosen sufficiently small. Finally, (6.30) is a consequence of Lemma 4.3 in Preufi and Vetter (2012). 
□ 



Proof of Theorem 4.3^ By employing the triangle inequality we can bound the Mallow metric 
between D%/Jvax(D%) and D%* /Jvai(D%*) by 



d 2 ( i%/JVax(i%),mjJVax(D*) )+d 2 [ r%JJVBx(D*J,i%*/JVax(D2* 



A2,* 



+d2 ( D^yJVzr(Dl*),D z //J\ & r(Dl 



2,* \ A2,* 



S2,*- 



[where D 2 -, a and Dy* are the random variables from Theorem 4.2 specified in the proof of which] 



It follows from the proof of Theorem 6.1 that the first summand converges to zero and the second 



summand equals zero because of Theorem 4.2 a). So it suffices to treat the third summand which 
is bounded by 



2E [Dp / JVar(L>^J - D 2 / / JVarp^J + 2E (D 2 // JVar^fJ - D 2 //^v(D 2 /] 



We obtain from Theorem 6.1 a) and b) that a constant L > exists such that 



Var(L>^*J > LN sd ~- l T- 1 (6.31) 
[note the we are under the null hypothesis and that we therefore have d^ = d\. This combined 



with Theorem 6.2, (6.30) and the conditions on the growth rate on p = p{T) yields that we can 
restrict ourselves to the second term, which is [up to the constant 2] bounded by 



< 



Var(D?*)Var(Dy' 

n{D 2 T*?) 



2,*\ 



Var(£&*J - JVar(D%*) 



Vax(££*)Var 



Var(^;)-Var(^*) 



If we follow the proof of Theorem 6.1 a), b) and employ (6.28) and (6.30), we obtain that 



E((Z^'*) 2 ) = O(l O g(Af)iV max ( 8 ^- 1 ' ) +2a T- 1 + N 8 ^ +2a - 2 ) 



(6.32) 



holds for every fixed a > 0. By employing (6.27) and similar arguments as in the proof of Theorem 
16.21 we obtain thereafter 



Var(D 2 '; a l AT(a) ) - \ a r(D 2 /l AT(a) ) = 0(p 8 log(T) 3 l og (JV)2( iV niax(8d-l,0)+2a r -2 + 7V 8d-2+2a r -l )) _ 



The assertion then follows with ( 6.30 )-( 6.32) and the assumptions on the growth rate of p = p(T). 



□ 
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7 Appendix: Auxiliary Lemmas 



Finally we show some lemmas which were employed in the above proofs. 



Lemma 7.1. Suppose fi,v,a,b € JR. Then there exists a constant C £ JR such that the following 
holds: 



a) If fi, v > and b > a, then 
N-i 1 

E 

k=0 
k-a>l 
-k+b>l 



{k-a) 1 -^ (b-k) 1 



< 



b-1 

E 



-'■ ~ k ^ +a (k - a) 1 -^ (b - k) 1 -" 



< 



C 



b) If < and < 1 — fx — v, then it follows for |a + b\ > 
AT-1 , „ oo 



E 



1 



1 



< 



C 



^ (k + by-v (k - a) 1 -" ~ ^ (k + by-v (k - a) 1 -" ~ \a + b\ 1 ~v- v ' 

k+b>\ k+b>\ 
k—a>l k—a>l 



c) If < v < I — fi and y, z > 1, then 



E 

k=l+y 



log(fc) 1 



< C 



log(y) 



k 1 -^ (k-yy~ u ~ y l ~v~v' 



E 



log(fe + z) 1 < log(2;) 



' (k + zy-v k x - v ~ z 1 



— fl — V 



Proof: a) Using equation 3.196(3) in Gradshteyn and Ryzhik (1980), it follows that 
6-1 



E 



1 



< 



(k - ay-v (b - ky~ v ~ J a (x- ay-v (b - x) 1 



1 



dx< 



1 



a y-n-v 



b) If a + b > we can bound the middle sum in (7.2) by 

1 1 ~ 1 1_ 

^ (k + by-v(k- a y~ u - 2s (k + by-* (k - a y- 

fc=max{l,l-6,l+o} V ; V ' k=l+a ^ > ^ ' 



< 



1 



1 



dx < 



1 



a (x + by-^(x-a) l - u ~ (a + by-^- u 



(7.1) 



(7.2) 



(7.3) 
(7.4) 



The last inequality follows from the equations 3.196(2) and 3.191(2) [for choosing b = 0] in Grad- 



shteyn and Ryzhik (1980). On the other hand, if a + b < we have 



E 



1 



fc=max{l,l— 6,1+a} 

OO 



E 



+ 6) 1 -** (k-a) 1 - 

J 1 



< 



< 



E 



s_ b (k + by-» (k- a y- 



k=i+(-b) 



(k - (-b)y-» (k + (-a)) 1 -" ~ (-a - by~»~ 
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The last inequality follows with 



Gradshteyn and Ryzhik (1980) as above. 



c) We start with (ffj]). Using equation 13.2(18) in |Erdelyi| (|1954b[) yields 



E 

k=l+y 



log(fc) 



1 



k i-n (k-y) 1 - 



< 



log(x) 



1 



-.1 — fjL 



(x - y) 



l-v 



dx< 



log(y) 



Concerning (7.4) we use equation 6.4(23) in Erdelyi| ( 1954a) which implies 
^ log(fc + z) 1 < r logfr + g) 1 



k=l 

Lemma 7.2. If < < \, then 

M N-l 



log (a: + z) 1 dx < logCg) 
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j'i^j2 Di-toi+ri— si=0 r 1 +V2-x 1 -si-W2+yi-y2+X2=0 

0<|«i-wi|<JV-l 0<r 1 +v 2 -x 1 +(j 1 -j 2 )N<N-l 
0<s 1 +w 2 -yi + (j 1 -j2)N<N-l 
0<\y2-x 2 \<N-l 
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(N 2 - N\y 2 - x 2 \ - N\vi - w\ \ + \vi - wi\y 2 - x 2 \ 



O 



Mjv) 



Proof: Firstly, we set < w\ = r\ — s\ + v\ and < x 2 = s\ — r\ — v 2 + x\ + w 2 — y\ + y 2 . Then 
we define p := r\ + v 2 — x\ + {j\ — j 2 )N and rearrange to < x\ = n — p + v 2 + (ji — j2)N. Since 
p £ {0, 1, 2 . . . , iV — 1}, it follows that if p, n, v 2 , x\,ji are fixed, there are at most two possible 
values for j 2 . Hence it is enough to consider the following expression with 1 < |Ci| < M — 1 
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(r\ — s\ + v 1) 1 {r\ — p + v 2 + C\N) 1 d °° (s\ — p + w 2 + y 2 — y\ + CiA^) 1 da 
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(n - p + v 2 + CiN) 1 d °° {ri - p + W2 - yi + V2 + v\ + C\N) 1 2d ° 
MN 2 ^ In -p + Ci7V| 2 - M - ~ AfiV 2 ^ (n - p + iV) 2 - M 

p,ri=0 p,ri=0 



< log(iV) _ log(JV) 
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